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Abstract — We consider data losses in a single node of a packet- 
switched Internet-like network. We employ two distinct models, 
one with discrete and the other with continuous one-dimensional 
random walks, representing the state of a queue in a router. Both 
models have a built-in critical behavior with a sharp transition 
from exponentially small to finite losses. It turns out that the 
finite capacity of a buffer and the packet-dropping procedure 
give rise to specific boundary conditions which lead to strong 
loss rate fluctuations at the critical point even in the absence of 
such fluctuations in the data arrival process. 

I. Introduction 

Many systems, both natural and man-made are organized 
as complex networks of interconnected entities: brain cells 
IT], interacting molecules in living cells Q, multi-species 
food webs [3|, social networks [4| and the Internet [5| are 
just a few examples. In addition to the classical Erdos-Renyi 
model for random networks [6|, new overarching models of 
scale-free [7| or small-world [8| networks turn out to describe 
real world examples. These and other network models have 
received extensive attention by physicists (see Refs. J9), ifTOl 
for reviews). 

A particularly interesting problem for a wide range of com- 
plex networks is their resiliency to breakdowns. The possibility 
of random or intentional breakdowns of the entire network has 
been considered in the context of scale-free networks where 
nodes were randomly or selectively removed ifTTl . 1121 . 1131 . 
or in the context of small-world networks where a random 
reduction in the sites' connectivity leads to a sharp increase in 
the optimal distance across network which destroys its small- 
world nature fl3l . OH, lfT31 . In all these models, the site or 
bond disorder acts as an input which makes them very general 
and applicable to a wide variety of networks. 

Network breakdowns can result not only from a physical 
loss of connectivity but from an operational failure of some 
network nodes to forward data. In the more specific class of 
communication networks, this could happen due to excessive 
loading of a single node. This could trigger cascades of failures 
and thus isolate large parts of the network |16|. In describing 
the operational failure in a particular network node, one needs 
to account for distinct features of the dynamically 'random' 
data traffic which can be a reason for such a breakdown. 
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In this paper we model data losses in a single node of 
a packet-switched network like the Internet. There are two 
distinct features which must be preserved in this case: the 
discrete character of data propagation and the possibility of 
data overflow in a single node. In the packet-switched network 
data is divided into packets which are routed from source to 
destination via a set of interconnected nodes (routers). At each 
node packets are queued in a memory buffer before being 
serviced, i.e. forwarded to the next node. (There are separate 
buffers for incoming and outgoing packets but we neglect 
this for the sake of simplicity). Due to the finite capacity of 
memory buffers and the stochastic nature of data traffic, any 
buffer can become overflown which results in packets being 
discarded. 

We consider a model where noticeable data losses in a single 
memory buffer start when the average rate of random packet 
arrivals approaches the service rate. Under this condition 
the model has a built-in sharp transition from free flow to 
lossy behavior with a finite fraction of arriving packets being 
dropped. A sharp onset of network congestion is familiar to 
everyone using the Internet and was numerically confirmed in 
different models ATI . Here we stress that such a congestion 
can originate from a single node. 

While data loss is natural and inevitable due to the data 
overflow, we show that loss rate statistics turn out to be highly 
nontrivial in the realistic case of a finite buffer, where at the 
critical point the magnitude of fluctuations can exceed the 
average value, while they obey the central limit theorem only 
in the (unrealistically) long time limit. Such an importance of 
fluctuations in some intermediate regime is a definitive feature 
of mesoscopic physics, albeit the reasons for this are absolutely 
different (note that even in the case of electrons, the origin of 
the mesoscopic phenomena can be either quantum or purely 
classical, see, e.g., [18]). Although we model data arrivals as a 
Markovian process, the loss rate at intermediate times shows 
long-range power-law correlations in time. When excessive 
data losses start, it is more probable that they persist for a 
while, thus impacting on network operation. 

The average loss rate and/or transport delays were previ- 
ously studied, e.g., in the theories of bulk queues [19], l20l 
or a jamming transition in traffic flow [21 1. What makes 
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Fig. 1. The model of data losses: incoming packets randomly arrive in 
discrete time intervals and join the queue of length I limited by the memory 
buffer capacity L. Packets in front of the queue are served at the same time 
intervals. If the queue sticks to the boundary, newly arriving packets are 
discarded. 

present considerations qualitatively different is that we analyze 
fluctuations of a discrete quantity, the number of discarded 
packets. Although fluctuations in network dynamics were pre- 
viously studied (see, e.g. [22]), this was done in the continuous 
limit for the data traffic, through measurements or numerical 
simulations. 

II. The Discrete Model 

The mode of operation of a memory buffer is that packets ar- 
rive randomly, form a queue in the buffer and are subsequently 
serviced. Each packet in the queue has typically a variable 
length and is normally serviced in fixed-length service units 
at discrete time intervals on a first-in first-out basis. Here we 
choose the simplest non-trivial model of this class: (i) packets 
have a fixed length of two service units; (ii) arrival and service 
intervals coincide. The length of the queue after n service 
intervals, £ n , serves as a dynamical variable which obeys the 
discrete-time Langevin equation, 

C+i (1) 

where the telegraph noise is defined by 

, J 1, 0<£ n <L-l 

£„ = < £ = L probability p 

(2) 

£n = I _ 1 < 1 £~<'l with probability 1 ~P 

The above means that the length of the queue, measured in 
service units, either increases by one when one packet arrives 
and one service unit is served, or decreases by one when no 
packet arrives. The boundary conditions above correspond to 
discarding a newly arrived packet when buffer is full (£ n = L) 
and to an idle interval when no packet arrives at an empty 
buffer (£ n = 0). In a more general, continuous model we 
will remove these restrictions, allowing for arbitrary quasi- 
Markovian nature of the input data traffic. The characteristic 
features of our results would not change. 

The main quantity which characterizes congestion is the 
packet loss rate which is defined via the number of packets 



discarded during a time interval N by 

no+N 

£iv(n ) = Si niL 5i n+li L ■ (3) 

n— no + 1 

This means that the packet is discarded if by the moment 
of its arrival the queue was at the maximal capacity L as 
illustrated in Fig. Q] Thus the loss rate (01 is defined entirely 
by the processes at the boundary of the random walk (RW) so 
that the continuous limit cannot be exploited. This makes the 
loss statistics profoundly different from, e.g., the thoroughly 
studied statistics of first-passage time. 

We will consider the average and the variance of the loss 
rate, which we obtain directly from Eq. (01: 

(£ N ) =V st (L)Ng 1 (L,L), (4) 

n +N 
n,m— no + 1 

= (C N ) +2T st (L)g 2 1 {L 1 L) G m -n-i(L,L). (5) 

n<m 

where the conditional probability of the queue being of length 
I at time n provided that it was of length I' at time n is 
defined by 

Here (...) stand for the averaging over the telegraph noise of 
Eqs. (HJ - (fJJ. The stationary distribution of the queue length 
is related to Q by 

V st {£)= lim Q n -n (t,t) = {8t n ,l) ■ (6) 

no — > — oo 

After relatively straightforward calculations ll23l . we find and 
expected (actually, built-in) critical behavior of the average: 

f 2p-l, P > |; 

i L+l _ L 

l-2p „L „ - 1. 



where q = p/(l —p). Thus the loss rate for p > 1/2 is of order 
1, for p = 1/2 a small fraction of the buffer capacity and for 
p < 1/2 an exponentially vanishing function, as expected. The 
matching between the three asymptotic regimes takes place in 
a narrow region (of width ~ 1 /Li) around p = i . 

The result for the variance is far more interesting, as it 
shows significant fluctuations in the loss rate. It is convenient 
to express the variance in terms of the 'compressibility' 
defined by 

(SC 2 N ) = XN (Cn) , 8C N (n) = C N {n) - (C N ) . (8) 

The compressibility has been exactly calculated in ||23l . Its 
behavior of x lS illustrated in Fig. [2] which shows its fast 
increase at the critical point, p = 1/2. 



This long-time correlation (in spite of the packet arrival being 
Markovian) is another clear sign of criticality. 



III. The Continuous Model 



30- 



20 



10- 



0> 
1000 



600 



200 




2p-l 



Fig. 2. compressibility \ (f° r L> = 1000) shows a fast increase of fluctuations 
in time at the critical point, p = 1/2. 



For N > N = [(2p - l) 2 + (n/L) 2 ] , i.e. in the 
limiting steady-state regime the compressibility saturates at 

-|2p-l| |< 

Xoo = S (9) 
|2p- 



2r 
3^' 



1|£<1 



Although it diverges in the thermodynamic limit, L — > oo and 
N/L 2 — > oo, at the transition point p = 1/2, the variance © 
in this limit remains finite and obeys the central limit theorem. 

However, this limit is reachable at the critical point only for 
unrealistically long times N^NqocL 2 . In the intermediate, 
practical regime, 1 <C <C No, the compressibility rapidly 
increases with time: 



XN 




(10) 



so that the variance exceeds the average value of the loss-rate 
and, as can be shown, its distribution is no longer normal. 

It is even more interesting that in this regime the fluctuations 
of the loss rate are no longer Markovian as they exhibit long- 
time correlations. To show this, we consider the temporal 
correlation function of the loss rate defined by 

(SC N (0) SC N (M)) 



R 2 (N, M) 



{SC%) 



M > N . 



In the most relevant regime, Nq ^> N ^> 1 and M > N, this 
has been calculated to give 



R 2 {N,M) 



pN 

XN 



-M(2p-l) 2 /2 



7rM 



- |2p- l|erfc(j2p- l\^/M/2 
At the critical point this reduces using Eq. (TTOb to 



R 2 (N,M)\ p=1 



P=l/2 



N 
2ttM 



(11) 



(12) 



Consider the output buffer attached to the switching device 
in the router. The speed of the input line of the buffer 
(effectively, it is the capacity of the switching fabric) would 
normally be much bigger than the speed of the output line 
of the buffer. Hence, we can consider the packet arrival as 
an instantaneous process. Packets arrivals are treated as a 
renewal process. The capacity of the buffer is L (measured 
in bits). The lengths of arriving packets are treated as random 
(measured in bits). The service (the capacity of the output 
link) is considered to be deterministic. Note that the packets 
are put in frames before being sent to the output link, the 
corresponding randomness (due to the randomness of packet 
size) is neglected because it is much smaller of the randomness 
of the arrival process (the overheads are much smaller than the 
typical size of the packet). We normalise the lengths of packets 
p, the speed of the output link r out and the queue length I by 
the size of the buffer L (the size of the buffer is then set to 
be 1). 

The procedure is the following: assume that at the moment 
of arrival of a packet of size p, the state of the queue is £, this is 
followed by the gap 77 (random inter-arrival time) till the next 
arrival. If I + p < 1 then the packet joins the queue and the 
queue length prior the next arrival is £' = £+p—r]/r]o if £' > 
and £' = otherwise. If £+p > 1 then the packet is discarded 
and the queue length prior the next arrival is £' — I — rj/r/o if 
£' > and £' = otherwise. Here we introduced the following 
time scale 

(13) 



1 

Tout 



the time required to empty a full buffer provided there are no 
new arrivals. 

Assuming that the maximum packet size is much less than 
1 (the buffer size) and the average incoming traffic rate r m 
(also normalised to the buffer size) is close to the service rate: 



\VinV0 ~ 1| < 1 



(14) 



we can treat p, rj and £ as continuous variables. 

Our aim is to calculate the statistics of the amount of the 
dropped traffic and the service lost due to idleness of the output 
link during time £ ^> 77 (77 is the average inter-arrival time) in 
the regime ( fl4l . 

In the regime ( fl4l > and for observation times t 3> fj, 
the system can be described by the Fokker-Planck equation 
as follows (in terms of the transitional probability density 
function w(£', t; £)) 

d t w(i', t; £) = -ad e >w(l', t; £) + ~a 2 d%,w(l', t; £) , (15) 

where a and a 2 are average moments of the change of the 
queue size per unit time 

1 , . o 1 



At 



(A£) 



At 



(A£ 



At -> (16) 



and the following boundary and initial conditions are imposed 

^'M)Uo,i=0> (17) 
w(£',t-£)\ t=Q = 5{£' -£) (18) 

where 

J(£', t: t) = aw{£', t; I) - -a 2 d e ,w(£', t; £) , (19) 

is the probability current. By At — > in eq. (fTSI l we mean that 
it is much smaller than the observation time, but large enough 
so that the underlying stochastic processes can be considered 
as continuous: 

fj < At < t (20) 



The solution of (115117118b can be expressed as follows 

k=1 (21) 
x [27rfccos(27rfc^) + vs\tl(2M')} 

x [27rfc cos(2M) + v sin(27rfc£)] 



where 



a o t 

V — — o , T = 



(22) 

Note that the solution (l2"TT i can be expressed in terms of 0- 
functions. 

For the Laplace transform of w(£', t; £) we have 

1 v{t' — t) 

W(i',e;£) = /: T w(£',t;£) = --^ 



2 resinh(ft) 

x J — coshkf/ +£-l)] + — sinh[^' + * - 1)1 
e e 



+ cosh[ K (|f - £| - 1)] + cosh[K(T + £ - 1)] 



where 



K = \/e + u 2 



(23) 



(24) 



From d23l we have for the probabilities of returning to the 
boundaries 



W(0, e; 0) = - [KCotanh(K) - v] 
e 

W(l, e; 1) = - [KCotanh(K) + v] 
e 



(25) 



These will be used in the next section. 

IV. Statistics of losses 

In this section we will concentrate on the statistics of the 
losses due to the buffer overflowing. The corresponding for- 
mulae for the statistics of the server idleness can be obtained 
using transformation £ — > 1 — £, v — * —v. 

First, we estimate the size of fluctuations of the losses on 
time scale t <C 2/cr 2 . In order to do that we consider the 



dynamics of the system near the boundary I = 1 which is 
governed by the following transitional probability: 



w (£',t;£) 



1 



x -j^exp 
a 



\J2-koH 
(£' - £) 2 



exp 



■ exp 



2aH 
2a(l-£') 



\- exp 
crfc 



a(£! - £) a 2 t 
(2-1' - If 



2oH 
£' - £ + at 



(26) 



V2aH 



which is the solution of ( fTBI l when the boundary £ = is sent 
to — oo. The change in the state of the system during time t 
can then be represented as follows: 



A£(t) =£'-£= A£ {t) + A£ loss (£', t; £) 



(27) 



where A£$(t) is the change in the state of the system if there 
was no boundary, its statistics is determined by 



(A* (t)) = at , ([A£ {t)} 2 ) = a 2 t + o(t) 



(28) 



and A£i oss (£' ,t;£) is the amount of traffic lost due to buffer 
overflowing. The moments of d28l can be defined as follows 



([A£(t)] n ) = / d£'d£ {£' - £) n w (f, t- £) P (£) 



(29) 



where p(£) is the stationary distribution. 

For the first two moments d29b in the limit f->0we have 



(A£(t))=at+^p(l), ([A£(t)} 2 ) 



(30) 



From ( 127128130b we can conclude that 

a 2 t 



(A4»(t)> 



-p(l) 



2 (31) 

([A^ loss (<)] 2 ) + 2(A£ (t)A£ loss (t)) = o(t) 

The first of the relations OTb means that A£i oss (^', t; £) is non- 
zero only if £',£ <~ 1 in the limit t — > 0. The second relation 
means either 

([A^ loss (i)] 2 ), (Ae (t)A£ loss {t)) = o(t) (32) 



A£ loss {t) = -2A£ (t) + o(Vt) (33) 

The relation d33l does not make sense physically, so in what 
follows we accept option (l32l and show that it is consistent 
with the later calculations. 

Next we lift the restriction t <C 2/cr 2 . It can be shown that 
the conditional moments (with the condition that the system 
was in the state I at the beginning of the observation interval) 
can be expressed as follows: 



^i ( oi(*; (■) = k\rt ss J] / dti [] w(l, t j+1 - t r , 1) 
»=i { j=i 



(34) 



X 10(1,*!^) , tk+1 



where w(£',t;£) is determined by ( f2TT > and 

rioss = lim i f d£' j d£ A£ loss [£', t; £) 



(35) 



1 C ^ 

= lim - / d£'d£ (£'-£- at)w (£', t; £) = y 

— OO 

For unconditional moments in the stationary regime we have 



™£s(*) = jd£m^Jt;£)p(£) 
o 

fe Tl + X fe-i 



(36) 



fc! IT / dr i n^( 1 ^ J +i-^; 1 )-P( 1 ) 

„' 1 J A 1 



i=l „ j=l 



where r is defined in (1221 and p{£) is the stationary solution 
of (IB): 

2ve 2 ^ 



pW 



e 2i, _ ! 



(37) 



To calculate m[^l s (t) we consider its Laplace transform: 



M£> (e) = /V^ft) = / dr e-^ft) 



(38) 



fe!p(l)[W(l,e;l)] fc - 1 4 



where W(l, e; 1) is is defined by ( |23l . From fl38l we obtain 

<7 2 f 



"^(*)=P(1K = P(1) : 



(39) 



For the moments d38l with fc > 1 we can identify the following 
regimes: 

^lossW - U Ip fc (1)e -(fc+l) e<<1 (4U) 



Correspondingly, for the moments in ^-representation we have 

(41) 



m&w = { mi) w^m T<<1 

^p k (l)T k r>l 



Now we turn our attention to the calculation of the PDF 
Pioss(x;t) of the amount of the lost traffic, x, during time t. 
To calculate it we consider its characteristic function in the 
e-representation: 



floss ^) — -^a; floss {x 1 ^) 

Pioss (x; e) = £ T p\ oss (x;t) 
From d42b we obtain 



(42) 



OO / \ „ 

k=o ' i 



Pi oss (x;t) 



OO / 



(43) 



where 



OO 

floss(e) = £ T Pio SS (t) , Pioss{t) = J dx p loss (x,t) (44) 



with 1 — p\ oss (t) being the probability for the system not to 
drop a single packet over the period of time t. Substituting 
(138T > into d43j we have 



floss(s; e) = P loS s(e) + ^ J2(~ S ) k [W(l, e; l)]*" 1 



floss (e) + 



P(l) 



e 2 W(l,e;l) 



fe=i 
-1 - 



l + sW(l,e;l) 



In order that P\ oss (s; e) did not have an abnormal behaviour 
(in particular, it did not contain terms like 5(x)), we must 
assume that 

p(1) 



Hence, 

floss (x; e) 



floss (e) 



P(l) 



e 2 W(l,e;l) 



exp 



W(l,e;l) 



(45) 



(46) 



e 2 W 2 (l,e;l) 

Integrating this relation over x, we recover d45l ), which shows 
that our assumption is indeed correct. 

In the regimes of short and long times we have 



Pk> ss (x;t) 



and 



p(l)erfc 

X — TJ>(1) 



T < 1 
T > 1 



(47) 



Ploss(t) = 



p(l 



1 



r«l 



r > 1 



(48) 



The conditional PDF (with the condition that the system 
dropped at least one packet during the time t) can be defined 
as follows 



P\ oss (x;t) 

Ploss(*) 



-erfc 



x — rp(l) 



T < 1 



T > 1 



(49) 

Now let us compare the results of the continous approach 
with those of Section II. To make the comparison, we calculate 
the central moments of losses in a similar way as the uncon- 
ditional ones in Eq. d36l ). Here we will consider the variance 
of the losses crf oss (t) only in the limit r ^> 1: 



'loss 



-cotanh|u| — sinh 



(*) 



M < l 



fc=i 



^^os'sW M»l 



(50) 



In this long-time limit the ratio of the variance to the square 
of the average vanishes, so that the distribution of data losses 
obeys the central limit theorem, as also seen from the second 
line of Eq. d49l i. This is essentially in agreement with the 
result of the compressibility \oo in ll23l . Naturally, the present 
considerations are much more general as we have not imposed 
any artificial limitations on the random input traffic. 

Finally, we calculate the correlator of the fluctuations of 
losses measured during two time intervals of length t% and t 2 
correspondingly and separated by the time T: 

1 

corvih , ta , T) = Jd£ p(t u t 2 ,T)- m« (t , )m^ s (t 2 ) 
o 

where 

p(t 1 ,t 2 ,T) 

ti t 2 

= rLs J <J d<2 w(l, t[ + t 2 -t' 2 + T; l)p(l) 



with ri oss defined in d35l ). 

In the regime T t\it 2 and T ^> 2/cr 2 it can be shown 
that 

corr(ti,t 2) T) -» , (51) 

T — >oo 

as we would expect. In fact, the correlator goes to zero expo- 
nentially if v 7^ 0. In the opposite regime 2/cr 2 3> T ^> t\ , t 2 
we have 

corrfa, t 2 , T) = (tx)m« (52) 

which is again in agreement with the results of the discrete- 
time considerations of Section II, showing the universality of 
the present approach. 

V. Discussion and Conclusion 

As one would expect intuitively, loss events separated 
widely in time are uncorrelated as shown by equation dBTl l. By 
widely separated in time, we mean that the time separation of 
the two observation intervals in which losses occur is much 
longer than the time over which fluctuations of queue length 
become comparable or much bigger than the buffer size itself, 
i.e. 2/cr 2 . 

However, in the case when the separation time is much 
smaller than 2/tr 2 , the correlations of loss fluctuations are 
decaying very, very slowly, as can be seen from equation j52l . 
Such time intervals are likely to be comparable or even smaller 
than the round trip times for typical TCP connections. TCP is 
the protocol that controls the rate at which data is sent across 
a network, between a particular source and destination. The 
exact details of the congestion control operation of TCP can 
be found in 11241 . 

Considerations of losses in a network, rather than in a 
single buffer, would require knowledge of the distribution and 
correlations of data traffic through different buffers comprising 
the nodes. The two input parameters, a and a 2 in Eq. (fTol l 



for a single buffer, are determined by the network topology, 
the routing protocol, and the external input traffic distribution 
to the network. Of course, a detailed knowledge of all these 
parameters is never available for a realistic network. We will 
consider an analytically tractable albeit a simplified model 
with a homogeneous external traffic (all flows from any source 
to any destination are considered statistically equivalent). 
Then the above individual single-buffer input parameters are 
straightforwardly connected to the number of flows passing 
through the appropriate buffer. This number, in turn, depends 
on the topology, the protocol and the external load and is 
equal to the link-betweenness of the corresponding buffer. 
Fortunately for our considerations the distribution of these 
quantities are empirically known through measurements on 
the Internet (25). This allowed us to analyze fluctuations and 
temporal correlations of losses in a realistic model of the 
Internet |26l . 
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